DE eng

Search in the Catalogues and Directories

Hits 1 – 10 of 10

1
The Multilingual TEDx Corpus for Speech Recognition and Translation ...
BASE
Show details
2
End-to-end ASR to jointly predict transcriptions and linguistic annotations ...
NAACL 2021 2021; Fujita, Yuya; Omachi, Motoi. - : Underline Science Inc., 2021
BASE
Show details
3
A Corpus for Large-Scale Phonetic Typology ...
BASE
Show details
4
A Corpus for Large-Scale Phonetic Typology ...
BASE
Show details
5
A corpus for large-scale phonetic typology
BASE
Show details
6
A Corpus for Large-Scale Phonetic Typology
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
BASE
Show details
7
Massively Multilingual Adversarial Speech Recognition ...
BASE
Show details
8
Analysis of Multilingual Sequence-to-Sequence speech recognition systems ...
BASE
Show details
9
Low-Resource Contextual Topic Identification on Speech ...
Abstract: In topic identification (topic ID) on real-world unstructured audio, an audio instance of variable topic shifts is first broken into sequential segments, and each segment is independently classified. We first present a general purpose method for topic ID on spoken segments in low-resource languages, using a cascade of universal acoustic modeling, translation lexicons to English, and English-language topic classification. Next, instead of classifying each segment independently, we demonstrate that exploring the contextual dependencies across sequential segments can provide large improvements. In particular, we propose an attention-based contextual model which is able to leverage the contexts in a selective manner. We test both our contextual and non-contextual models on four LORELEI languages, and on all but one our attention-based contextual model significantly outperforms the context-independent models. ... : Accepted for publication at 2018 IEEE Workshop on Spoken Language Technology (SLT) ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://dx.doi.org/10.48550/arxiv.1807.06204
https://arxiv.org/abs/1807.06204
BASE
Hide details
10
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern